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* , <- . t $ ^ ^ 

Furthermore, although the primary structure ;s unrelated. the tertiary structure oi 
these two enayme families bung together a cons soma catalytic triad of amino acids 
consisting of senna nisiidine ano aspartate. 

* *r are serine proteases (approx. MW 27.503) which am secreted t 
large amounts Pom a wide variety of BaoiilUS species and otaer microorganisms 
f he r. rote i ss i termmed f rem a- 

vm . m m v m^and a b et at ■ 983;. Hoppe-Se-^er'sZ Phvso _ '>em 
364 > <■ - -> > ^ < at t 

BacOus > > w o Baconw /ehenec/ema sad severe: natural variants of 8. 

ientus have been reported These studies indicate teat although subtilisa; 
genetically unrelated to the mamma^n scene proteases, it has a similar active site 

eeay stsi structure su coma ova / Pcu 
ooo . h, "-oe- . ' t em - - " ^ ~~m44atrv 

product complexes ; Robertas J.D . el a! (1876) J. Siol Cnem. 261:109?- 11 03} 
have alec ; ao; ve s is ano putative substrate 

binding deft o sub sin O t e ertosr of kinetic a m w n o: 


WO 99/20769 


PCT/US98/22S0O 


-2- 

modlfioafion studies have been reported for suotiiisin ; Svendsea B, (1 078), 

1 r * - \> i t ^ <r S 3 

report wherein tne side chain of methionine at rescue 222 of subtiilsm was 
t ^ Ov c< ■> C i 

8 - v. <, < 244 •> \ , oet Vv ~sa 

been earned out (Weils and Esleli (1988) TIBS 13:291-29?) 

Summary of the invention 

it is an object herein ro provide a orotease variant containing a substitution of 
10 an amino ac;0 at ore or more residue positions corresponding to residue positions 
elected fro se grour s;shng of 6^ 
n> ' ? >t ac e t jtilisin 

■M <\ o ibinatior f th< ■ \ listed an io a 1 - sistutfc s may at 
r>ro«* ' ? c Tf prise 

1 § >* the 

residue positions correspond * positrons < Baaitus arrryloiiQuefaciens suotiiisin: 
(1) a protease variant intruding substitutions of the amino acid residues at 
position 62 and at one or more of the following positions 103, 104. 109, 1 SS\ 213. 
232 236.. 245, 248 and 252: 
20 (2) a protease variant including substitutions of tne amino acsd residues a? 

position 2 1 2 and at ore or n one of t 12, S8, 102. 103. 104, 1£ S 

232. 238 245, 248 and 252: 

(3) a protease variant including substitutions of the amino acid residues at 
position 230 and at one or more of the following positions 68, 103, 104. 150. 232. 
28 236 and 246; 

{4} a pros on? f nino add res-dues a 

position 232 and at one or more of the following positions: 1. 9, 12, 61 : 62 : 88, 76, 
9?. 98, 101, 102, 103, 104, 108. 130, 131, 169, 183, 185, 205. 209. 210, 212, 213, 
217, 230, 236, 245, 248, 252, 257. 250. 270 and 275; 

30 V f thtt ! t t eSiOUOS i 

position 232 and at one or more of the following positions 103, 104, 238 and 245; 

i a protease va 1 < a s - i d 

position 232 and 103 end at one or more of the following positions 1, 9, 12, 61, 62, 


WO «9/28?<>« 


rrCT/t;S98/22SO0 


-3- 

88, 76, 9?. 98. 101 102. 103. 104. 109. 130. 131, 150, 183. 135. 205. 209. 210. 212, 
213, 217. 230 ; 235. 245. 248, 252, 257. 260, 270 and 275; 

1 1 as v mi , suDstitutions of t no acid residues 

position 232 and 104 and at one or more of the following positions 1.9. 12. 61 : 62. 
5 68, 76, 97, 98. 101. 102, 103. 104, 109, 139, 131, 159. 183. 186. 206, 209, 210, 212, 
213, 217. 230. 238, 245, 248.. 252. 257. 260. 270 and 276: 

(8) > pro ~~~ * sussoics at 
x>$ri»on 232 ana 235 ana " » } N '< 1 

88.78. 97, 93, 101, 102. 103, 104. 109. 130, 131. 159. 183, 185, 205. 209, 210, 212, 
10 213. 217, 230, 238, 245. 248. 2S2. 257, 260, 270 and 275: 

(9) & profaa^ vanam including < r^ * s ot me amino ~- o i m > «. m 
position 232 aoa 245 and at one or more of the following positions 1. 9, 12, 81, 62, 
68. 76, 97, 98. 101, 102, 103. 104. 103. 130. 131, 153, 183. 185. 205, 209. 210, 212, 
2.13, 217, 230. 238. 245, 248, 252, 267, 280, 270 and 275; 

IS Q) a j t N tstuttons of s ss;d ues a 

position 232, 103, 104, 235 and 246 and atone or more of the foiicwmg positions i . 
9. 12. 81. 82. 88, 76, 97, 98, 101, 102, 103, 104. 109. 130, 131, 150, 183, 185, 205, 
209. 210, 212. 213. 217, 230, 236, 245, 248, 262, 267, 260, 270 and 275; 

> "v 0 , i PCS St. 

20 position 252 and st one or more c tne following positions 1 . 9, 12. 61, 62, 88, 97, 98, 
101. 102, 103, 104. 109, 130. 131. 159. 183. 136. 210. 212. 213, 217, 232. 236. 245. 
248 and 270: 

{12} a protease variarn including SuO^ivtson*. of the amino acid residues at 
position 272 and at one or mors of the following positions 103. 104. 236 and 245: 
28 (13) a protease vanaot tncmotng suosotufions of the amino sod residues at 

positions .252 and 103 and at one or more of the following positions 1 , 9, 12, 61, 62, 
68, 97. 38. 101, 102, 103, 104, 109, 130, 131, 159. 183, 185, 210, 212, 213. 217. 
232, 236, 245, 248 and 270. 

'4 a protease variarn :naoono suostikmons o the amino acio residues at 
30 positions 252 and 104 and at one or more of me following positions 1. 9. 12, 81, 62, 
68. 97, 88, 101, 102, 103, 104., 109, 130, 131. 159, 183, 185, 210, 212. 213. 217, 
722 236. 245. 248 and 270; 

(15) a protease variant including substitutions of the amino acid residues at 
positions 2S2 and 236 and at one or more of the following positions 1 , 9, 12 81 , 62, 


WO 95/20769 


68, 97, 88, 101, 102, 103, 104, 109, 1 3D, 131, 159, 183, 185, 210, 212, 213. 21?, 
232. 236. 245. 248 ana 270; 

(18) a protease variant including suostitutions of the amino acid residues at 
cor* ns 252 and 245 « la one * ^ 12 87 d2 

5 68. 97. 08. 101. 102. 103, 104. 109. 130. 131. 1§9 : 183. 185. 210. 212, 213, 217. 
232. 235. 245, 245 ana 270; 

(17) a protease variant including substitutions of the amino acid residues at 
positions 252. 103, 104, 236 and 245 and at one or more of the following positions 1 
3. 12, 81. 62. 68. 97. 98. 101, 102, 103, 104, 108. 130. 131, 159, 183, 185. 210, 212 
10 213. 217. 232, 238, 245, 248 and 270: and 

> v 3. ^ < c ua*ng si tvStit* I > * 1 

position 257 and at one or more of the following positions 68, 103. 104. 205, 209, 
210. 232. 236. 245 and 370 More preferred protease variants are substitution sets 
selected from tne group consisting of residue positions corresponding to positions sn 
i § t is \ 
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st is a further object to provide DMA sequences encoding such protease 
variants, as well as expression vectors containing such variant DNA sequences. 

Still further, mother object of the invention is to provide host ceils 
transformed * s *ectc js f ha spabie of expressing 

such DNA to produce protease variants either intraceiiuiariy or extraceiiularly. 

There ,e< further provided a cleaning composition composing a protease 
variant of the present invention. 

Additional there is provid - ? a protease variant 

of the present invention. 

%UH\. ' "1 ( < 

protease vanaot of the present invention. 

M C-e-HE DRAWINGS 

Figs. 1 A--C depict the DNA and amino acid sequence for Saaffm 

i f ! s gene. 

Fig. 2 depicts the conserved amino sad residues among subtiiisins from 
BBC'lus a;- > . {BPH}' and Bacsilus i&nius (wild-type). 

< - , 5 r v ^ r ! $ ' > 

line represents the amino acid sequence of sybtUistn from Bacillus amy!oiiQvefacien& 
sobtiiisih (also sometimes reierrsn to as suof^s-n SPNh The second iine depicts the 
amine acid sequence of subtilisin - Sao-hes cubb/m The < ? iirie depicts the 
amino acid sequence of subtisin from 8. iwnenifonm The fourth sins depicts the 
p 3:^t\f''* i ■■- ' >i 

309 in PCX WO89/08276). The symbol ♦ denotes me absence of specific amino add 
residues as compared to subhlisln BPH". 


Detailed Description of the Invention 

Proteases are - s 
bends of protases m t d As used > protease" means a namr t\ v- 
30 occurring protease or a recombinant nrotease Naturally- occurring proteases include 
a~amineaey1peptide hydrolase pet* c >c v < ~ ~f hydrolase 

serine carboxyoeo: case, metailo car boxy peptidase, thiol proteinase, 
ear boxyi proteinase and metailoproteinase. Serine, metailo, thiol and acid proteases 
are included, as well as endo end exo*proteases. 
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occurring esrdonyl hydrolase variants ipromsse variants; having a different 
i C , U shsu U - . M 5\!- 

characteristic 38 compared to the precursor caroony; hydrolase from which the 
5 amino add sequence of the variant ss derived. Specifically, such protease variants 
have an ammo acid sequence not found in nature, which is derived by substitution of 
a plurality of amino acid residues of a precursor protease with different amino acids. 
The precursor protease may da a naturally-occurring protease or a recombinant 
protease. 

10 The protease variants useful herein encompass trio substitution of any of the 

3iu r , < eo amino acre res due 

positions * s an oe made in any precursor subtliisin tprocaryocc. 

e imaiian. etc 5 application re ere see s Ytade to 

various amino acids by way of common one - and three-tetter codes. Such codes 
15 are identified in Dale. fVI.W. (1989 Select 1 f Bacteria. John Wiley S 

Sons, Ltd., Appendix B. 

The protease riants >t i from a Baciftm 

subtiiisin. More preferably, the protease variants are derived from Bacilius lentus 
i&l'S ~ 

20 Subiilsslns are bacterial or fungal proteases which generally act to cleave 

peptide bonds of proteins or peptides. As used >■<■!<..-!■ he .■- means a naturally- 
i cjot! on 1 c e s * 1 > v. « , , < f 

v s s know 3 1 produced and o en secretes. ari n aroblai species 
Amino acid sequences of the members of this series are not entirely homologous. 
2S H «vj m \ n * > > *U , - re sni^ of pm 

ctlviiy. This r * - acid saquenc 

defining a catalytic triad which distinguishes them tram the chymotrypxin related 
class of serine proteases. The subtilisins and chymotrypsin related serine proteases 
both hovu a » 1 in the subiiilsln 

30 related proteases the relative order of these amino acids, reading from the amino to 
t te i v j > e-sem 1 the ivn * u js retetei 

proteases the relative order however is his >> s sspabats - rone Thus, subtilissn 
here refers to a serine , - 1 the catalytic inaa of so;, i sn e alaa 
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proteases. Examples include out are not limited to the subtiliskm identified in Fig. 3 
herein. Generally and for purposes of the present invention, numbering of the amino 
adds in proteases corresponds to the number* assigned to the mature 8ae///us 
ytoiiq t ' »u sin seque presented Rg, 1 
5 "Recombinant subtisin" or ^recombinant protease" refer to a suotiHsin or 

protease m * < * >. < ee< s ^ -mm ^ 

to produce a vacant (or mutant; DNA sequence which encodes the substitution, 
deletion or insertion of one or more amino acids >n the naturally-occurring amino acid 
sequence. Suitable methods to produce soon modification, and which may be 
10 combineo vmh fnose Ciseosed naresn. include -s >- in US Patent RE 

34,808. US Patent 5.204.015 ana US Patent S.I 85.258, U.S. Patent 5.700.575. U.S. 
Patent 5,801.038. and U.S. Patent 5,763,257. 

"Non-human subtifis!ns ,: and the DNA encoding them may be obtained from 
i is ot pmi.a oti 

15 organisms include gram negative organisms such as E coil or Pssudommoas and 
i ucaryotic 
organisms irorn which suOtiiisin snd their genes may he obtained include yeast such 
as SaGcftaramyc&s cem^sfee. mngi seers as Aspergillus so. 

A "protease variant" has an amino acid sequence which is derived from me 
20 aeeno acsd sequence of a "precursor protease 1 '. The precursor proteases include 
naturally-occurring proteases and recombinant proteases The ammo acid sequence 
of the protease variant =s "derived" from the precursor protease amino acid sequence 
by the substitution, deletion or insertion of one or more amino acids of toe precursor 
amino acid sequence. Suon rnodific V susnce" which 

25 encodes the amino acid sequence of the precursor protease rather than 

.mm, com "\ . iw . c ^ ^ * methods fm such 

>n r r , N v J 3 " ar e r as 

well as methods known to those skilled ;n the art (see, for example. EP 0 328299. 
"* "o-iee.n 
30 Specific substitutions of amino ac;ds at one or more residue positions 

corresponding to residue positions selected from the group consisting of 62, 212, 
230, 232. 252 and 25? of 8aa!!us > -uc r ac,cn& sut\ are identified herein. 


WO $9/2t>7S$ 


pcr/i nso 


•34- 

deferred vana . f havng ib * * - s at r©< 
posuonscc"*^ ^ ^ >"-.. ^ >^ x -nr Table 

1. 

More preferred variants ara those having combinations f substitutions at 
5 residue positions corresporo ~ o;^ f , ^ ! > s 

Table 2. 

Further preferred variants are those navsng comDinauons o; substitutions at 
residue posdsor s cooes ) r )f 8 5 suouitsm 

labia 3, 
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These am n v . ac o sbers refer to those assignees to me mat m 

Ld ^ -> „ j "he' 1 °v o v o* 

ho , - i e sub-tlHoin fc jxte ds to 

pr&cam proteases cent § which are 

S "equivalent" to the oertiomar identified residues in Bacillus amyloHqu&f&cmns 
suOliSisin. In a preferred embodiment of the present invention, the precursor 
protease is 8aciHas tenuis subtslisiu and me substitutions are made at the equivalent 
amino acid residue position? - o< j to t m listed above^ 

A residue (amino assd) position of a precursor protease is equivalent to a 
10 ^ - > * ~ , ^ , - m; * -■ \m > 

corresponding m ■ osition in estner primary or tertiary structure) or analogous m a 
specif .c res due or porucn of thai ret 

having the same > ruler funchonai capacity t ami e. n v r niereet 
chemically). 

1$ In order to establish homology to primary structure, the ammo acid sequence 

Of a precursor protease is directly compared to the Bacillus amyloliqu&f&dens 
sudtilisin prsmary sequence and particularly to a set of residues known to bo invariant 

- " m - n st Ot: 

o ^ v x ^ 

20 subtilisin. After aligning the conserves residues, allowing for necessary insertions 
and deletions in order to maintain alignment (I.e.. avoiding tne elimination of 
-onsprv* t ; 1 , r en u, ; - - " ^ o-c eu^ u<" 

to pedicular amino acids in the pnmary sequence of Bee .•>)•>;(.• ■•su^faams 

>referal olo d conserve 

25 100% of soon residues However alignment of greater than 75% or as little as 50% 
-/ sense \m o v. ,o^ maeoi 

of the catalytic it an Psp32 HjsS4 Ser22l should be ma ntained Sieaen el al 
(1991 ; Protein Eno. 4I7V.71 9-73? shows the alignment of a large number of serine 

30 proteases. 

- d > w r. a n 1 * a r 

amyioiiquBfad&m, Bacillus subtilis, Bacillus ruwemfooms {carisbergenss$} and 

sarsaligr I >vid« s <■> maximum amom h ry bs wear 
amino acid sequences. A compadson of these sequences shows that there are a 
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numbsr of conserved residues contained in each sequence. These conserved 
residues (as between 8PN' end 8. /emus) are identified in Fig, 2, 

These conserved rescues, thus, may he used to define; the corresponding 
equivalent amino asm residues of Bacillus atnyiakQueksams subtihsin m other 
5 subtiissins sucn as suotTsir rrom Bgcwus i&mts (PCX Publication No VV083/QS279 
published July 13. 1989). the preferred protease precursor enzyme herein, or the 
subtilise referred to es P892 (EP 0 32S 299). which is highly homologous to the 
referred £ iilush t ssubtiiss he a u acid seque esc ertain of these 
suPtilisins are aligned in Figs. 3A and 3B with the sequence of Bsaiii/s 
10 Bmyloiiquefaciens subtiiisln to produce the maximum homology of conserved 
residues As can oe seen there are a number o\ deletions in the sequence of 
Bacf&u.s Zooms- as compared to Bacons amy/o/icuefacrens * >- r Thus, for 

j n -50 as o foj Vs II SS is ? * suet is n 

« s btiiisins s isoieucsne for 8 mtus and i ? ? 

15 cusva^ < < t homology at the 

level of tertrary structure for a precursor protease whose ternary structure has been 
determined by x-ray crystallography. Equivalent residues are defined as these for 
which the c < jtoms of a particu or 

, < O i ^ * _ 

20 SUbtilisin (N on hi CA on CA. C on G and O on O) are within 0.1 3nm ana preferably 
O.lnrn after alignment. Alignment is achieved after the oest model has been 
-mcmed - t^o nort- 

wojoge ") * - ' ! 

subtil sm < - 

25 a able 


Equrvait 5 pacific ressd m 

ens subtHtsm are defined as those amino acids of the 
precursor protease which may adopt a conformation such that they either alter, 
modify Of contribute to protein structure, substrate binding or catalysis in a manm t 
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defmsd and attributed to a specific residue of the Bocfe amyk>mu»^ci&o$ 
su«ia Further, they are those residues of the precursor protease {for which a 
tertiary structure has been obtained by x-ray crystallography) which occupy an 

n to the extern that, although the main chain stores of the given 
s criteria of equivalence on the oasis of occupying a 

s of at least two of the side chain atoms 
of the residua i;e with ion t U t i 1 tede >' " 3a xitus 

The coordinates of the three dimensional structure c? 
s subtilissn are sot forth in £PO Publication No. 0 251 448 
1 0 (equivalent to US Patent 5. 182.204. the disclosure of which is Incorporated Herein by 
reference} and can be used as outlined above to determine equivalent residues on 
the level of tertiary structure. 

Some of the rescues ideomieo tor suDSiitaeon are conserved residues 
whereas others are not. in the case of residues wh;ch ore not conserved the 
1 5 suostinit.cn of one or more amino acids is limited to substitutions which produce a. 
variant which has an amino a«d sequence that does not correspond to one found in 
nature. In the case of conserved residues, such substitutions should not result In a 
natural > cc ' r sequence. The prorea.se variants or the present invention include 
the mature forms cf protease variants, as well as the pro- and prepro-forms of such 
20 protease variants The prepro-forms are me preferred construction since this 
fa 1 hes fhe express or sec; ihion and maturaticr f n e -"tease variants 

I 'rcsei t em u sn ends Count I N -tormina 

pert or of the mature form of a protease wntch ween removed results in the 
appearance ef the "mature" form of the protease, Many proteolytic enzymes are 
25 found in nature as transnational proenzyme products and, so the absence of post- 
transiatiooal processing, are expressed in this fasnion A preferred prcseeuence for 
producing protease variants is the putative resequence ef Bao&s 
amyioliquefaaens subtlfisir;. slthougn other crcteeee p-oseQuences may ee used. 

A "signal sequence" or "presequence" refers to any sequence of amino acids 
30 bound to the N-terminal portion of a protease or to the N4erminal portion of a 
propmtease which may participate In the secretion of the mature or pro forms of the 
protease. This definition of signal sequence is a functional one, meant to include all 
those amino acid sequences encoded by the N-terminal portion of the protease gene 
in the effectuation of the secretion of \ 
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conditions. The present invention utilises such sequences to effect the secretion of 
the protease variants as defined herein One possible signal sequence comprises 
the first seven amino acid residues of the signs; sequence from Bacillus subtil® 
subtsllsm fused to toe rem*«def of the signal sequence of the subtslisiri from Bacillus 
5 lenius {AJQO 21536). 

A "prapro" form of a protease variant consists of the meters form of the 
protease having a prosequence opsrably linked to the amino terminus of the 
protease and a "pro or "signal" sequence ooerabiy linked to the amino terminus of 
the prosequence. 

10 "Expression vector" refers to a DNA construct containing a DMA sequence 

which is ooerabiy t -co to a to c control sequence capable ct effecting the 
expres ic-n of sasd NA s suilal wsf Such control segue ?< tie; 
promoter to effect transcription, an optional operator sequence to control such 
transcnptio t a sec, snt ; sustai ? RN N > sornr binding s tes and 

15 sequences which control termination of transcription and translation. The vector may 
be a piaamid, a phage particle, or simply a potential genomic insert. Ones 
transformed into a suitable host, the vector may replicate and function independently 
of the host genome, or may. in some instances, integrate into the genome itself, in 
the pr~~ 

20 <t f < f > s m itr^f f 

However, the invention is intended to include such other forms of expression vectors 
which serve equivalent functions and which are. or become, mo^n ;n tne art. 

. gene ally pr coary 
eucaryetic hosts which nreferaoly cove been manipulated by the methods disclosed 
25 in US Patent RE 34.606 to render them incapable of secreting enzymatlcaiiy active 
endeprotease. A preferred host „ for expressing prote- ~ f - > '<* 

^ ~„ q^nemt m k-, , \ 

proteasr suhciisi f - G203 ~ s x detail in US 

Patent 5,264,366. Other host cells for expressing protease include Bacillus V« 
3D 1168 (also described in US Patent RE 34,605 and US Patent 5,264,355, the 
- e of v\hcr a^e incorporateo h^re or, e*~ ?nce , - >\ 
Bacillus strain such as 8. kenemfomiss B. Imitus.. etc. 

Host ceils are transformed or transferred with vectors constructed using 
recombinant DMA techniques. Soon transformed host cells are capable of either 
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ating vectors encodir ise va mts em * seeded 

protease variant, in the case of vectors wnich encode the pre- of prepro-f orm of the 

tease ? a s =a > *■ f 

celf into the host cell medium. 
5 "Operabiy linked. " when describing the reigtion$nip between two DMA 

regions, simply means, that they are functionary related to each other. For example, 
3 presequenee is operabiy linKod to a oeotide if it functions as a signal sequence, 

rtlcipatsr ts sect - n most crobab 

involving eieavage of the signal sequence. A promoter is operabiy linked to a coding 
10 sequence if it controls trie transcription of the sequence; a r iboscme binding Site ss 
operabiy linked to a coding sequen e j * „ transta 

The genes encoding tne naturaiiy-occumng precursor protease may be 
obtained in accord with the genereu metnoos known to those skliied in the art, The 
methods generally compose synmesuong iadeied probes having pui^e sequences 
15 encoding regions of the protease of interest, creoanng genomic libraries from 
organisms expressing the protease, a no screening the libraries for the gene of 
interest by hybridization to tne probes Positively hybridizing clones ere then 
mapped and sequenced. 

to OADress 

20 * » i s h copy number c-ussmid 

! * ' «> > 1 iV"i *< 

ssar\ i he a=oe in 

estlon {w y b* 58 tt v <> ' v lt( * i 

ecogn , trans Ded. by the nost ttansonpt ; ?e -sronend 

25 poiyadenylation region •: necessary for stability of the mRNA transcribed by tne host 
from the protease gene in certain eccaryotic host celts) which Is exogenous or is 
suppi ed by the endogs * ees rafoly 3 

sesocbonqo - 

maircenai-" - 1 «. . - m v - < *~ a > 

3D m v nber s t e s a ontwn a t i f > el 

enabling large numbers of ptasrrslda to be generated In the cytoplasm without 
chromosomal limitations. However, if is ; within the scope herein to integrate multiple 
copies of the protease gene into host genome. This is facilitated by prccaryotrc and 
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ti >~o vet r ovnnfsrrs wretch * - 
recombination. 

The gene can be a natural 8. /endes gene. Alternatively, a synthetic gene 
encoding a naturally-occurring or mutant precursor protease may oe produced in 
5 such an approach, the DNA and/or amino acid sequence of the precursor protease is 
determined. Multiple, overlapping synthetic $;ngie-stranded DNA fragments are 
it a synihet r 

DNA encoding the precursor protease An example of cwnt.nahc gene construction Is 
set forth in Example 3 of US Patent 5,204.015 the disclosure of which is 
10 Incorporated herein oy reference. 

Once the naturally-occurring or synthetic precursor protease gens; nas been 
eieeee. a number ci modifications are undertaken to enhance the use of the gene 
bev -*nn synthesis of the n ig csor profess s nodiftca N iv- 

include the production of mcomOinam proteases as disclosed Iri US Patent AG 
15 34.606 and PPO Publication No. 0 251 ^co and the production of protease variants 
described herein. 

The following cassette mutagenesis method may be used to facilitate the 
construction of the oroteasc variants oi the oresent invention, although other 
methods may be used. First, the naturally-occurring germ encoding toe protease is 

20 ob - yo md sequem c- N c * 2 is scannc for a 

point at which it is desired to make a mutation {deletion, insertion cr substitution) of 
one or mors amino acids in tne encoded enayme The sequences Hanking this point 
are evaluamc tor the presence oi restriction sees ?o ; replacing a shorr segment of 
'io qo*e o< 11 ' " " , ! . < oe various 

25 - r*< c, m, v m on ase ^ne 

so as to iaollitate tne replacement oi the gene segment Howeve- . any convenient 
restriction site which Is not overly redundant in the protease gene may be used, 
provided tne gene fragments generated by restriction Oigesi;oo can be reassembled 
in proper sequence, P restriction Sites am not present at locations within a 

30 convenient distance from the selected pom! {from 10 to 15 nucleotides), such Sites 
are generated by substituting nucleotides m the gene in such a fashion that neither 
the reading frame nor the smsno acids encooee arc changed in the final construction. 
Mutation oi the gene in order to change its sequence to conform to the desired 
sequence is accomplished by M13 primer extension in accord with generally Known 


1 j< 


methoos. Trie tasn ot socating suitaose tsanfeng regions ana evaluating ti 
changes to arrive at two convenient restriction site sequences is made routine by the 
reounoeney ot the genetic cede, a restriction enzyme map ot tee gene ana tee large 
number ot ditferent rest not son enzymes. Note teat c a conven; 
site is available, tee above methoo need be used only in connection v 
reman vOncH coec no: careen a see 

Ones tee nmuraiiy-occurnnc OHA or synthetic QUA '--'a cioneo. tno restriction 
sees tiankmg tee positions to be rm natea are digested vath a cognate restriction 
erotvmes and a plurality of end terrniriecomoienieraary ceioomocieohde cassettes are 
ligaied into the gene The mutagenesis is simplified by this method because all of 
n " $ba.> arc 


tea oste ot ' ' \oS ot 
peptide bonds per milligram of active enzyme. Many weii known procedures exist for 
measuring proteoiyac activity ;K M Kate, "Microbial Ptoto :sss c £ syag a r i 
Biochemlcjl^ A. Fsechter sd.« 1988}.. in addition to or as 

an alternative to modified proteolytic activity, the variant enzymes of the present 
invention may nave other modified properties soon as K m , p,,., k ;v a'Kb, ratio and/or 
modified suostram specificity and/or modaeo pH activity profile. These enzymes 
<.-meeta ~-m - * o ■> * < ~ . *~ i m, 

example, in the preparation of peotides or for hydrolyttc processes such as \aum?y 


In cms asc 

having altered, preferably improved wash performance as compared to a precursor 
25 protease in at j ear a? least >ne set ot wash 

cepditioos, 

Tnere is a variety a; wasn conditions mcmoirm veerying detergent 
formulate \ volume, wash \ /ssn cm; 

that a protease vanam msgnt ce exeosed tc Per example. detergent formulations 
aO usee ;n different areas nave deferent concentrations of teen relevant components 
present in the wash water. For example, a European detergent typically has about 
4500-5GD0 pprn of detergent components tn the wash water while a Japanese 
detergent typically aas approximately 55 r ppm a? detergent components in fee wasn 
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water, inNc 

tr ' \ ;- r " • ■ •> . s 

A iow ci&tergent concentration system mciodes detergents wherr 
about 800 ppm of detergent components are present >n the wash water. 


approximately 60/ ppcc ot detergent components present j r j the wash water. 

,«.. eteoiurn oetergent concentration memoes detergents wnare oetween about 
800 ppm and about 2000op n oi detet > j a ese the wash 

water North American detergents are generally considered to he meowm oetergent 
10 concentra s s they nave approx detergent 

components present in the wash water, eras! typically has approximately 1500 ppm 

A Won detergent concentration system includes detergents vocem greater 
than about 2000 poet of detergent components are present in the wash water, 
15 „ e i to be p e ?. 0; n oncen tit 

3 <. 1 i 1 o e*. <> m no ~e ^m n ^ 

wash water. 

Latin American detergents are generally high suds onosphate builder 
detergents and the range of detergents used in Latin America can fait in both the 

20 medium and high detergent concentrations as they range from 1500 ppm to 5GQ0 
ppm of detergent components ;n the wash water As mentioned above, Brazil 
typically has approximately 1500 ppm of detergent components present in the wash 
water. However, other high suds phosphate builder detergent geographies, net 
■OS ~ i i scent etto' s 

25 systems up to about 8000 ppm of detergent components present m tne wash water. 
In light of the foregoing, st - evident that concentrations of detergent 
compositions ^ typical wasn soinnons tnroognout the wono vanes tram iess tnan 
anoi.it BOO ppm ot detergent composition {'low oetergent concentration 
gee graph -cample abo » 007 ppm in Japan to as we« 1 abc it 8C C pp n 3 

30 about 2QGG ppm (Medium detergent concentration geographies!, for example about 
075 ppm in U.S. and about 1500 ppm in Brazil to greater than about 2000 ppm 
Thigh detergent concentration geographies"), for example aboot 4500 ppm to about 
5000 ppm in Europe and about 6000 ppm in high suds phosphate builder 
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I he concentrations of the typical wash solutions are determined empineasly. 
For example, in the U.S., a typical washing machine holds a volume of about 64.4 1 
of wash solution. Accordingly, in order to obtain a concentration of about 975 ppm of 
detergent within the wash solution anon; 62 79 g of detergent composition must be 
5 added to the 64 A I of wash solution. This amount is the typical amount measured 
into the wash water by the consumer using the measuring cup provided with the 
detergent. 

As a further example, different geographies use different wash temperatures. 
The temperature of the wash water in Japan is typically less than that used in 
10 Europe. 

Accordingly one aspect or the present invention includes a protease vanam 
mat shows improved wash performance ;r; at least one set re wash concisions 

in another aspect of the invention, It has ten determined that substitution of 
an amino acid or one or more residue positions corresponding to resioue positions 
IS selected from the group consisting of 62. 212. 230, 232, 252 and 257 of gpeftw 
> > ying s nee of toe 

enzyme. 

These substitutions are oreferabiy made in Bacillus lenius (recombinant or 
nativeuypet suohiisin. airhougn me substitutions may oe maua in any Bacillus 
20 protease. 

u,^ ^ e w i - i > - n^ w< 

mutations in BsaUus fiivyioiiquefaaenn suotilism are imoortant to trie proteolytic 
activity, performance emoor stability of mese enjymes end the cleaning or wash 
hone a 5 - zymes; 
25 Many Of the protease variants of the invention are useful In formulating 

vanous oefergent oornpositions or personal care formulations such as shampoos or 
lotions. A number of known compounds are suitable surfactants useful in 
composseons comprising the protease mutants ot the invention;. I heso income 
nonionlo : anionic, catiemc, or zwltferiomc tttergems as disclosed ;n US 4 404 12S 
30 to 8arry J. Anderson and US 4.261 .BOB to Jin Flora, et ai. A suitable detergent 
formulation is that described In Example 7 of US Patent 5,204.015 (previously 
incorporated by reference}, f he art is familiar with toe different formulations widen 
can he used as cleaning compositions In addition to typical cleaning compositions, 
s re a * s . at t >, „ ants of t press . ; a jy b« 
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■47- 

ueed for any p irpose that native or wsm * ss are t sed T! t & t e 

variants can be used, for example, in bar or liquid soao applications, dlsheara 
formulations contact tens. ciean-r.g solutions cr products peptide hydrolysis, waste 
treatment: textile applications, as tusion-Oieavage enzymes in protein production: 
5 etc r h~ variants of the present invention may compass enhanced performance in a 
detergent composition (as compared to the precursor}. As used herein, enhanced 
performance in a detergent is defined as increasing cleaning of certain enzyme 

jrvs^va steins s - rmco as : i Oy usua; eva ton after s 

standard wash cycle, 

10 Proteases of the invention can be formulated into known powdered and liquid 

detergents having pH between 8 5 and 12.0 at iev#§ of about 0.01 to about 5% 
(preferably 0.1% to 0.6%) by weight. These detergent cleaning compositions can 
also include other enzymes such as Known proteases, amylases, celluloses, lipases 
or endogiycosidases, as well as builders and stabilizers. 

15 - ' 1 eaning 

compositions does not create any special use imitation, in other words, any 
* «. 5 for the present 

compositions as long as toe pH is within the above range, and the temperature is 
below the described proteases denaturing temperature. In addition, proteases of 

20 the invention « * t ' ! } s n ' 

a'onr. m * ~< 

lay ~ ! ti iai 

contain add.hves whom are commonly used in cleaning compositions These can be 
25 selected from, but not limited to, bleaches, surfactants, builders, enzymes and 

beam; csmkais it would ne moth accmem to one of ordinary sl-cll in the ad what 

additives are suitable for tncfusion into me compositions. The list provided herein is 

by no means exhaustive and should be only leeeo as <nr: >m of suitable additives, 
v - s- r e -\ o c > s<r - - 

3D additives which are compatible with the enzymes and other components in the 

composition for examp surfactant 

When present, the amount of additive present In the cleaning composition Is 

from about 0.01% to about 99.9% preferably about 1% to about 95%. more 

preferably about 1% to about 80%. 


WO 9$/20?$? 


-a- 

The variant proteases of 1 present invention can be included in snsmaJ feed 
such as part of artimai feed additives as descried in. for example. US 5.612.055; 
US 5,314.692: and US 5.147,642. 

One aspect of the invention is a composition for the treatment of a textile that 
S includes variant proteases of the present invention. The composition can be used to 
treat for example silk or wool as described m publications such as RD 218.034: EP 
134.267: US4.S33.350: and EP 344.259. 

The following is presented by way of example and is not to bo construed as a 
limitation to the scope of the claims. 
10 Alt publications and patents referenced herein ere hereby incorporated by 

reference in their entirety. 

gxample 1 

A large number of protease variants were produced and purified using 
15 methods well known in the an. Ail mutations were made in BacSfos tentus GG36 
subtilisin, The variants are shown in Table 4. 
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A large number of the proieass variants produced in Example 1 were tested 
for performance in two types of detergent end wash conditions using a mteroswatcb 
5 essay describee in An improved method of assaying for a preferred enzyme and/or 
preferred rerge c ~>«8 796 

Table 6 i;sts the variant proteases assaved and the results of testing in wo 
different detergents. For column A. tee detergent was 0.6? g/s filtered Aries Ultra 
(Procter & Gamble. Cincinnati. OH, USA), h a solution containing 3 grains per gallon 
10 mixed Ca 2+ /Mg 2 * hardness, and 0,3 ppm enzyme was used in each well at 20*C. 
For column B, tne dstorgsm was 3.38 g/l filtered Anal Feres (Procter <ix GarnPle. 
Cinclfmat OH, US *7Mg 
.hardness, and 0.3 ppm enzyme wae used m eaen well at 40*C. 
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Example 3 

Tabs© 6 lists the vanam proteases assayed from Example 1 ana the results of 
testing in four different detergents ?ne san s t „ 

5 were done on 5 e noted s o* 3 detergents Fa ium 

A, the aetetg^rt vm < 3 ™tHp 2 :n a v OH, 

USA}.. In a solution containing 3 grams oar gallon mixed Ca^/Mg 24 " nardness, and 
0.3 porn enzyme was used sn eaen weii at 20X. For column B. the detergent waa 
? g ^ e <> G > > C m : nnat O ■ -> a a * ^tii 

10 containing 16 grams par gallon mixed (AA'A'Mg^ hardness., and 0 3 ppm enzyme 
was i sea " , ( f roctsr 

Gamam, Cincinnati. OH. USA), m a solution containing 8 grains per gallon mixed 
C U s s 3 J ' f a 

column D, 1.5 m\l\ Tide KT detergent (Procter & Gamble, Csncrnnati. OH, USA), in a 
IS solution containing 3 grains per gallon mixeo Ca^/Mg 2 ' 5 * hardness, and 0.3 ppm 
w i < ! ai20*C- 


T 


"s""3" 


itiwt! 


fTiif 


■ 111 


T 


I 


iiiiiilliil 


s 1 S I S 


Bill 


Is 


SI! 


SI! 


r 


Si 


I ill ll S 


!IS 

111 


ill 

1 M.HH 


WO 99/30769 


84 


o 

o 


o 


16 


(X 

SO 
sO 

5* 

55 

" 


s-s-.s 
X> 


5 

B 










O 










,. 

£ 


*x 

iO 
i.A 


o 

05 

X.i 

■N 
xj- 

o 

xf 


as 


i 





■ 

!» 

CO 

n 

CO 
tO 

<x 

CO 
X 


O 

o 

sr 

o> 


SO 


8 - 


*r 

o 






o 

o 




o 








§ 


EN 



5 



2.52; 

3 

«3 



?X 



sO 























































ITS 











rc 

> 


<B 

1 

6 

QS 

>o 
c\ 

Lo 


z 

[ 

io 
Cx 

xt 
iX 

z. 




1 

!X 



z 




z. 





Z 

CO 


Q 

03 

5? 

i 


D 

Q 

Q 

D 


?x 

«-) 

G 
so 

cx 

1 

s 

-xi 

(X 


<x 

§ ! 






p: 


{X 






'X 



£; 

L; 


-z 






u> 

So 




I 


ti 


Q 




Q 

1 

1 

| 

| 



"* 

1 



•5 





x ; 

f 





Q 



pi 





0; 




xi 

























x; 

cx 




<o 

M 


1 





1 

1 

I 

o 

a 

>;x 

a 

a 

a 

0 

CN 

O 

a 

a 

a 

o 

C 

a 

(X 

a 

a 

8 

q 

o 


1 








X 




f 





1 




CN 

CN 

q 

P. 

P 




:: : 


% 


1 

Pi 



x ! 


<<} 

< 

< 


< 

a 

0 

< 

G 


a 

G 

o 


CO 



f 

g 


CO 


1 

> 
< 


to 








§ 

cx 











1 

3 

iX 

| 

a; 






o; 

OS 

— 

& 





1 


| 


X 

$ 

1 

1 

Cc< 







1 




T4. 


(!) 

o 


G 

o 

G 

5 

O 

> 

O 


:.. 


G 


6 

T 
a 

1 

I 

S 



1 

e 

i 

; 

a 

% 


O 


1 







> 

> 

> 

> 



h 




I 






< 



% 
o 



< 


< 

< 





•:x 





° 

a 

cx 
o 

:: 

o 

C5 


o 

o 

o 

o 


o 


o 



to 

so 

69 

CO 

CO 


0> 


SO 

sO 

69 

69 

CO 

CO 

< 

s 

< 

1 

< 

< 

a 

5 


< 
1 



69 

1 



< 


< 

> 



1 

> 

co 
> 



5.? 

CO 

> 

> 


wo mimm 



8 

3 

§ 


s 

8 

3 


a 


a 

s 

a 

EN 

a 

a 

1 

CN 

E 


~ 


9 

g 


9 

9 

z 

a 

3 
<s 

9 

z 

Q 

z 

9 
z 

Q 
z 

9 

9 
z 

s 




s 

fN 

CN 

s 


5 

5> 
o 


2 


a 

8 

| 

OS 

? ' 



5 

s 

«> 

**> 


« 



m 

s 

1 

5 

« 

a 

CN 

a 









i 






































1 

















li 

I 





1 

1 

1 



1 

1 

l 


11 

1 


1 

! 

1 

s 

1 

it 

1 

1 



1 

1 

I 



I 

1 



wo nmm 


% 3 5 5 S ! 


? s 1 1 S r. I 1 S 5 * ! 


ill 


I I I i I i 


liiliiiill 


i 8 i 


I 1 S s § I 5 I § I 5 § 2 


<S W n d 


- = 


lis 


IS ill i: : 


\ ISIISi 


inn 


in 


in 


s 


ISS1I 


III 


IEI 


ill i ii i if 


ii 


■ 3 S S 

a \o \> \o o 


ii 


II SI! 


III!! 


0 


in 


1 


WO 5^/3076$ 


I 

8 111 


5 s 5 s : 


2 5 


« s § 


lis s 


ill 


s g 8 e a 


i § § § 

!?¥■ 


i 


I II ill II II III ill II is 


Sill 


linn 


IS I 


s 


III 


s 


S! 


i 


? 


in 


III!! 


!! 


il 


111 


UI 


s 


in 


i£i 


I 


ill 


n 


WO 99/28769 


§ 1 § II I § § § § I § I 
3 


I § g g S § 


is?. 


1 1 1 


!!! 


g s ; is s s ! I s I 1 1 


5 '- 


II I I 1 


Is 


S 


mm 


■ ■■II 


! ^ i < ::!! 


! 


II 


ill 


11 


? 


if 


Is 


11 


1 


III! 


WO §9/20769 


iiliSill i S i i §_§ l_i i 
¥Ts : I I I I s 'a 


§ | s s 5 fe § g § 2 | - 


; : - " S II - 


8 1 § S I I a ? 


1 


Ill 


mm 


fffyii!: 


III! 


Ill 


s 


iiiiilii 


lilllSf 


5 


I 


II 


Ij 

s 


U! JOS! 


Ill 


III 


III 


mm 


Mil 


1 


111 I 


Is 


11 


WO 9*207,59 


91 


I 1 I I i 1 !§ § I I 1 


I I § 


i I i i 


If 11 


§ I 8 


S I S § 2 S P 


I 


§ 5 3 i s i g 


j 


1 


ill'i . ; 


III 


USUI 


IS! 


IlHIIIil 


:1 


II 


III 


III 


111 


III 


1 1 il I S 


III 
! 


il 


- 


I? 


ill 


il 


it 


IS 


I 


ill l l j 


111 1 1 


92 


9 9 

9 9 
z 


^ -J. 

S 5 


s § 

1 S 







1 

11 

o 

1 

11 

1 

a: cr. 

1! 

11 

IS 



11 

11 

o o 

11 

o 

11 

11 

11 

III 

11 


11 


WHA] ISCLAME? 

1 , A protease variant comprising replacing an amino acid at one or more 
residue doss; sns corresponding to res ^ mi „ t 

5 consisting of 62, 2 1 2, 230, 232, 252 and 237 of Batiks amyhimefadms rnbmh. 

a. The protease variant according to ciasm 1 which is derived from a Bacillus 

10 3. The protease variant according to claim 2 which is derived from Bacillus 

ientus %utmimn 

4. A DMA encoding a protease variant of claim 1 . 

15 5. An expression vector encoding the DNA of claim 4. 

8. A hos? tra ' sector or aasm 

7, A cleaning composition comprising the protease variant of claim 1 . 

20 

3. An animal feed comprising the protease variant of claim 1 . 


9. Ac 
i 1. 


30 1 1 

sheeted from the nrouo nons^inn m re^dae o-os^on* corre^nood'nn to positions ir; 
Table 2 of Bacsfes miyloiiquzfaciem subtiitsin. 


WO*»2©7«9 


12 Ths protease variant according to claim 10 comprising a su&stiiution set 
sheeted from trse group f } 3 o< i 

Table 3 of Sacmvs amyfomuefaci&m subiiissia 
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CONSERVED RESIDUES IN SUBTIUSINS -DOM 
BACILLUS AMYLOUQUEFACtENS 

1 10 20 

21 30 40 

. . T 0 8 * V K V A V . p ■ * G . , * * H 

41 50 60 

0 1 - . . , G G A 8 . V P . i . . . x Q D 

61 70 SO 

. N , H G 3? H V A G * : . A A L K « S 1 G 

81 90 100 

V J, G V A PSA. L Y A V K V L G A , G 

101 HO 120 

S G v . S . 1 . . G , S W A . N - . , , 

121 130 14 0 

V - . K . S 1 G . P S . S . . > . > A . ., . . 

3.41 150 160 

. „ . . . G V « V V A A . G K . G . . . 

161 170 180 , 

Y A V G A . 

181 ISO 200 

D , , N * * A S F S . -0 . . L D » « A 

101 210 220 

W Q V » - Q S T . P G . , Y „ . . K G T 

221 330 24 0 

S M A . P H V A G A A A h , . . X . . , 

241 250 260 

K, . ): |D!l, . ,LG. . 

- 2*1 270 

. . ¥ G ■. G X< . W: , - A A . . 
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